Node Classification in Graph Data using Augmented Random Walk

نویسندگان

  • Hossein Rahmani
  • Gerhard Weiss
چکیده

Node classification in graph data plays an important role in web mining applications. We classify the existing node classifiers into Inductive and Transductive approaches. Among the Transductive methods, the Majority Rule method (MRM) has a prominent role. This method considers only the class labels of the neighboring nodes, neglecting the informative connectivity information in the graph data. In this paper, we propose an Augmented Random Walk (ARW) based approach to resolve main limitations of MRM. In our proposed method, first, we augment the initial graph by adding class labels as new nodes to the graph and then we connect each classified node to its corresponding class label nodes. Second, we apply a Random Walk algorithm to find the similarity score of each un-classified node to different class labels. Third, we predict class labels with the highest scores for the un-classified node. Empirical results show that our proposed method clearly outperforms the Majority Rule method in six graph datasets with high homophily.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Evaluating the Quality of a Network Topology through Random Walks

In this brief announcement we propose a distributed algorithm to assess the connectivity quality of a network, be it physical or logical. In large complex networks, some nodes may play a vital role due to their position (e.g. for routing or network reliability). Assessing global properties of a graph, as importance of nodes, usually involves lots of communications; doing so while keeping the ov...

متن کامل

Efficient algorithms for topology control and information dissemination/ retrieval in large scale Wireless Sensor Networks

Wireless Sensor Networks (WSNs) require radically new approaches for protocol/ algorithm design, with a focus towards energy efficiency at the node level. We propose two algorithms for energy-efficient, distributed clustering called Directed Budget Based (DBB) and Directed Budget Based with Random Delays (DBB-RD). Both algorithms improve clustering performance and overall network decomposition ...

متن کامل

On the Embeddability of Random Walk Distances

Analysis of large graphs is critical to the ongoing growth of search engines and social networks. One class of queries centers around node affinity, often quantified by random-walk distances between node pairs, including hitting time, commute time, and personalized PageRank (PPR). Despite the potential of these “metrics,” they are rarely, if ever, used in practice, largely due to extremely high...

متن کامل

LPKP: location-based probabilistic key pre-distribution scheme for large-scale wireless sensor networks using graph coloring

Communication security of wireless sensor networks is achieved using cryptographic keys assigned to the nodes. Due to resource constraints in such networks, random key pre-distribution schemes are of high interest. Although in most of these schemes no location information is considered, there are scenarios that location information can be obtained by nodes after their deployment. In this paper,...

متن کامل

Enhanced Random Walk with Choice: An Empirical Study

The random walk with choice is a well known variation to the random walk that first selects a subset of d neighbours nodes and then decides to move to the node which maximizes the value of a certain metric; this metric captures the number of (past) visits of the walk to the node. In this paper we propose an enhancement to the random walk with choice by considering a new metric that captures not...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2015